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Abstract 

In this paper, we consider a general time-inconsistent optimal control problem for a non homogeneous 
linear system, in which its state evolves according to a stochastic differential equation with deterministic 
coefficients, when the noise is driven by a Brownian motion and an independent Poisson point process. 

The running and the terminal costs in the objective functional, are explicitly dependent on some general 
discounting coefficients which cover the non-exponential and the hyperbolic discounting situations. Fur¬ 
thermore, the presence of some quadratic terms of the conditional expectation of the state process as well 
as a state-dependent term in the objective functional makes the problem time-inconsistent. Open-loop 
Nash equilibrium controls are constructed instead of optimal controls, by using a version of the stochas¬ 
tic maximum principle approach. This approach involves a stochastic system that consists of a flow of 
forward-backward stochastic differential equations and an equilibrium condition. As an application, we 
study some concrete examples. 

Keys words: Stochastic maximum principle. Time inconsistency, Linear quadratic control problem. 
Equilibrium control. Variational inequality. 
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1 Introduction 

Time-inconsistent stochastic control problems have received remarkable attention in the recent years. The 
risk aversion attitude of a mean-variance investor E], m and [9], such as the portfolio optimization with 
non-exponential discount function [B] and [7], provide two well-known examples of time-inconsistency in 
mathematical finance. Motivated by these practical examples, this paper studies optimality conditions for 
time-inconsistent linear quadratic stochastic control problem, where the state is described by a n-dimensional 
non homogeneous controlled SDE with jump processes, defined on a complete filtered probability space 
(n, T, (•^t)tg[o,T] > 


' d 

dX (s) = {A (s) X (s) + B (s) u{s) + b (s)} ds + {Cj (s) X (s) -|- Dj (s) u (s) -I- aj (s)} dW^ (s) 

f=i 

+ J {E (s, z) X (s—) -b F (s, z) u(s) + c (s, z)} N {ds, dz), s e [0, T], 

. X (0) = xo fe . 

( 1 . 1 ) 

The coefficients A{.) ,B (.), 6 (.), Cj (.), Dj (.), aj (.) ,E {.,.) ,F (.,.) and c (.,.) are deterministic matrix¬ 
valued functions of suitable sizes. As time evolves, it is natural to consider the linear controlled stochastic 
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differential equation starting from the situation (t,^) G [0,T] x (fl, J'tjPjR") 




d 


dX (s) = {A (s) X (s) + B (s) u(s) + b (s)} ds + {Cj (s) X (s) + Dj (t) u (s) + Uj (s)} dW^ (s) 

i=i 

+ J {E (s, z) X (s—) + F (s, z) u{s) + c (s, z)} N {ds, dz), s £ [t,T], 

X it) = e ^ 


( 1 . 2 ) 


Under some conditions, for any initial situation (t,^) and any admissible control u{.) the state equation 
is uniquely solvable, we denote by X (.) = {.;u{.)) its solution, for s G [t, T]. Different controls u{.) 

will lead to different solutions ^ (■) ■ To measure the performance of u (.), we introduce the following cost 
functional 


J {t,^,u{.)) 


= E* 


^{{Qit,s)X{s),Xis)) 


{Q {t, s) E‘ [X (s)], E‘ [X (s)]) + (i? (t, s) u{s),u (s))) ds 


+ (t) ^ + ^J'2{t) ,X (T)) 

+ i ((G (t) X{T),X (T)) + (G {t) E‘ [X (T)], E* [X (T)])) 


(1.3) 


The coefficients Q (.,.) ,Q {.,.) ,R (.,.), G (.), G (.), (.) and ^2 (•) are deterministic matrix-valued func¬ 

tions of suitable sizes, which explicitly depend on the initial time t in some general way. Our objective in this 
paper, is to investigate a general discounting linear quadratic optimal control problem for jump diffusions, 
which is time-inconsistent in the sense that, it does not satisfy the Bellman optimality principle, since a re¬ 
striction of an optimal control for a specific initial pair on a later time interval might not be optimal for that 
corresponding initial pair. The novelty of this work lies in the fact that, our calculations are not limited to 
the exponential discounting framework, the time-inconsistency of the LQ optimal controls that we are going 
to consider, is due to the presence of some general discounting coefficients, involving the so-called hyperbolic 
discounting situations. In addition, the presence of some quadratic terms of the expected controlled state 
process, in either the running cost or the terminal cost, make the problem time-inconsistent, this can be 
motivated by the reward term in the mean-variance portfolio choice model. The term (t) ^ + fJ ,2 {t) stems 
from a state-dependent utility function in economics [9]. Each of these terms introduces time-inconsistency 
of the underlying model, in somewhat different ways. 

The main difficulty when facing a time-inconsistent optimal control problem is that, we cannot use the 
dynamic programming and the standard HJB techniques, in general. However, the main approach to handle 
the time-inconsistent optimal control problems, is by viewing them within a game theoretic framework. 
Nash equilibriums are therefore considered instead of optimal solutions, see e.g. m. a, 0 , i, m. 0 , 
M: M, [la, m. ca, m and m- The fundamental idea is that the control action that the controller 
makes at every instant of time, is considered as a game against all the control actions that the future 
incarnations of the controller are going to make. Strotz |19] . was the first who used this game perspective 
to handle the dynamic time-inconsistent decision problem on the deterministic Ramsay problem [Id] . Then 
by capturing the idea of non-commitment, by letting the commitment period being infinitesimally small, he 
characterized a Nash equilibrium strategy. Further work which extend [16], are [10], [16], [15] and [8]. Ekland 
and Lazrak [B] and Ekland and Pirvu [7] apply this game perspective to investigate the optimal investment- 
consumption problem under general discount functions, in both, deterministic and stochastic framework. 
Then, by means of the so-called ’’local spike variation” they provide a formal definition of feedback Nash 
equilibrium controls in continuous time. The work [5] extends the idea to the stochastic framework where the 
controlled process is Markovian. In addition, an extended HJB equation is derived, along with a verification 
argument that characterizes a Markov subgame perfect Nash equilibium. In m, Yong studied a time- 
inconsistent deterministic linear quadratic model, and he derive a closed-loop equilibrium strategie, via a 
forward ordinary differential equation coupled with a backward Riccati-Volterra integral equation. Hu et al 
[S] studied a time-inconsistent stochastic linear-quadratic control model, which is originated from the mean- 
variance portfolio selection problem with state-dependent risk aversion, and by means of variational method 
they derive a general sufficient condition for equilibria, through a new class of forward-backward stochastic 
differential equation (FBSDE in short) along with some equilibrium conditions. In |21j Yong investigate a 
time-inconsistent stochastic problem for stochastic differential equation. By introducing a family of N-person 
non-cooperative differential games he characterize a closed-loop equilibrium strategie. 
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The purpose of this paper is to characterize Nash equilibrium controls for a general time-inconsistent 
stochastic linear quadratic optimal control problem. The objective functional includes the cases of hyper¬ 
bolic discounting, as well as, the continuous-time Markowitz’s mean-variance portfolio selection problem, 
with state-dependent risk aversion. We accentuate that, our model covers some class of time-inconsistent 
stochastic LQ optimal control problem studied by [9], and some relevant cases appeared in [20]. Note that, 
in [^ the weighting matrices do not depend on current time t and in |20] the terminal cost do not depend on 
current state Moreover, we have defined the equilibrium controls in open-loop sense (in a manner similar 
to |5]), which is different from the feedback form (see [2], [3], |3], |^, |7], [121, [H] and [T3]b 

The rest of the paper is organized as follows. In Section 2, we describe the model and formulate the 
objective. In Section 3 we present the first main result of this work (Theorem 3.2), which characterizes the 
equilibrium control via a stochastic system, which involves a flow of forward-backward stochastic differential 
equation with jumps (FBSDEJ in short), along with some equilibrium conditions. In Section 4, by decoupling 
the flow of the FBSDEJ, we investigate a feedback representation of the equilibrium control, via some class 
of ordinary differential equations, which do not have a symmetry structure. Section 5 is devoted to some 
applications, we solve a continuous time mean-variance portfolio selection model and some one-dimensional 
general discounting LQ problems. The paper ends with Appendix containing some proofs. 


2 Problem setting 


Let T] ,IP) be a filtered probability space such that To contains all P-null sets, Tt = T for 

an arbitrarily fixed finite time horizon T > 0, and (-Ft)(gjQy,j satisfies the usual conditions. We assume 
that (-Ft)(g[Q jn] is generated by a d-dimensional standard Browian motion {W (t))tg[Q and an independent 
Poisson measure on [0,T] x Z where Z C K. — {0}. We assume that the compensator of N has the form 
/i {dt, dz) = 9 (dz) dt for some positive and ct— finite Levy measure on Z, endowed with it’s Borel tr—field 

B (Z). We suppose that /1 A|z|^ 9 (dz) < oo and write N {dt, dz) = N {dt, dz) — 9{dz) dt for the compensated 
z 

jump martingal rondom measure of N. Obviously, we have 


Tt = a 


f X 4 ^(p_^]iV(dr, de) ;s <t,A e B{Z) V a [Bg-, s <t]VAf, 


where M denotes the totality of null sets, and cti V cr 2 denotes the cr—field generated by ai L)a 2 - 


2.1 Notations 

Throughout this paper, we use the following notations: 

• S'” : the set of n x n symmetric real matrices. 

• : the transpose of the vector (or matrix) C. 

• (.,.): the inner product in some Euclidean space. 

For any Euclidean space H = K", or 5" with Frobenius norm |.| we let for any t € [0,T] 

• (D, J^t,P;iL) := {^ : V, ^ H \ ^ is Tt — measurable, with E [|^|^] < oo}, for any p > 1. 

• {Z, B {Z) ,9-,H) ■= {.) : Z ^ H \ r (.) is B {Z) — measurable, with \r {z)f 9 {dz) < oo|. 

• Sjr{t,T;H) := {.) : [t,T] x ft ^ H \ X {.) is (J's)sg[t_T] “ adapted. 


s !->■ A(s) is cadlag, with E sup \X (s)f ds < oo > . 

se[t.T] I 


• ^ 3 ^ 


jC 


{t,T; H) := I^X {.) : [t,T] X n ^ H\X {.) is (J's)sg[i t] “ adapted, with E /j^|X(s)|^ds < oo| . 

([t, T] X Z; H) := {.,.)■. [t,T] x Q x Z ^ H \ R {.) is (d^s)gg[( — adapted measurable 

process on [t, T] x D x Z, with E |i? (s, z)'^ 9 {dz) ds < oo| . 
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• C ([0,T] ;i?) := {/ : [0,T] H\ f (.) is continuous} . 

• D [0, T] := {{t, s) G [0, T] X [0, T], such that s > tj . 

• C [D [0, T]; H) ■= {/ (.,.) : £> [0, T] —> iJ (t, s) \ f (.,.) is continuous} . 

• {D [0, T];H) := If (.,.) : D [0, T] ^ H\ f (.,.) and ^ (.,.) are continuous 


2.2 Problem statement 

We consider a n-dimensional non homogeneous linear controlled jump diffusion system 


dX (s) = {A (s) X (s) + B (s) u(s) + b (s)} ds + {Cj (s) X (s) + Dj (t) u (s) + aj (s)} dW^ (s) 

+ J {E {s, z) X {s—) + F {s, z)u{s) + c{s,z)} N {ds,dz), s£[t,T], 
z 


( 2 . 1 ) 


{ X it) = 


where (t,^, u(.)) G [0,T] x (tl, P; M") x it,T;'MX-). Note that it,T;'ME) is the space of all 
admissible strategies. Our aim is to minimize the following expected discounted cost functional 


J {t,f,ui.)) 

= E* J i ((Q (t, s) Xis),X is)) + (Q it, s) E‘ [X (s)], E‘ [X (s)]) + {R it, s) uis),u is))) ds 
+ it^i (t) £. + t‘'2it) ,X (T)) 

+ i ((G it) XiT),X iT)) + (G it) E‘ [X (T)], E‘ [X (T)])) 

overw(.) (t, T; R’"),where X (.) = (.; u (.)) and E* [.] = E [. | . 

We need to impose the following assumptions about the coefficients 


( 2 . 2 ) 


(HI) The functions ^(.),Gj (.) : [0,T] Bi.),Dji.) : [0,T] 6(.),crj(.) : [0,r] K”, 

Ei;.) :[0,T]x Z ^ F).,.) : [0,T] x Z ^ R^x"*, and c(.,.) : [0,T] x Z ^ R" are continuous, 

and the coefficients on the cost functional satisfy 

' Qi.,.),Qi.,.)eCiD[0,T]-,S^), 

Ri.,.)eCiD[0,T]-S^), 

< Gi.),Gi.)€Ci[0,T]-S^), 

Mi(.) GG([0,r];R"xn), 

^ M2(.) eG([0,T];R"). 


(H2) The functions R (.,.), Q (.,.) and G (.) satisfy 

Rit,t) > 0, Git) > 0, Vt G [0,T],and Q(t,s) > 0, V(t,s) G L>[0,T]. 


Under (HI) for any (t, G u (■)) ^ Pj ^ (U, Jt,P;R”) x C'^ (t, T; R™), the state equation (2.1) has 


a unique solution X (.) G (t,r;R") , see for example [TT]. Moreover, we have the following estimate 


E 


sup |X(s)| 

t<s<T 


< K 


(l + E[|e| 


for some positif constant K. The optimal control problem can be formulated as follows. 

Problem (LQJ). For any given initial pair it,f) G [0,T] x (U, P; R"), find a control u(.) G 
C^jr it,T-,W^) such that 

Jit,^.ui.))= min J(t,^,M(.)) 

u(.)e£^(i,T;R™) 
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Remark 2.1. 1) The dependence of the weighting matrices of the current time t, the dependence of the 
terminal cost on the current state ^ and the presence of quadratic terms of the expected controlled state 
process in the cost functional make the Problem (LQJ) time-inconsistent. 

2) One way to get around the time-inconsistency issue is to consider only precommitted controls (i.e., 
the controls are optimal only when viewed at the initial time). 


2.3 An example of time-inconsistent optimal control problem 

We present a simple illustration of stochastic optimal control problem which is time-inconsistent. Our aim 
is to show that the classical SMP approach is not efficient in the study of this problem if it’s viewed as 
time-consistent. For n = d = 1, consider the following controlled SDE starting from {t,x) G [0,r] x K 

J dX^’^ (s) = bu (s) ds -\- adW (s), s G [t, T ], , , 

where b and a are real constants. The cost functional given by 


J {t,x,u{.)) = 



\u{s)fds + hit) (X*’" (T) 



(2.4) 


where h (.) : [0, T] (0, oo), is a general deterministic non-exponential discount function satisfying h (0) = 1, 
h{s) > 0 and fj’ h (t) dt < oo. We want to address the following stochastic control problem. 

Problem (E). For any given initial pair {t,x) G [OjT] x R, find a control u{.) G L^jr (t,T;]R) such that 

J(t,x,u(.))= min J(t,x,u(.)), 


At a first stage, we consider Problem (E) as a standard time consistent stochastic linear quadratic 
problem. Since J (t,x,.) is convex and coercive, there exists then a unique optimal control for this problem 
for each fixed initial pair {t,x) G [0,T] x R. Notice that the usual Hamiltonian associated to this problem is 
H : [0, T] X R"^ — >• R such that for every (s, y, v,p, q) G [0, T] x R'^ we have 

H (s, y, n,p, q) = phv + aq- 

Let (.) be an admissible control for (f, x) G [0, T] x R. Then the corresponding hrst order and second 
order adjoint equations are given respectively by 

J dp*’® (s) = g*’® (s) dW (s), s G [t, T ], 

\ pM (T) = -h {t) (X*’® (T) - x) , 


and 

r (s) = (s) dW (s), s G [t, T ], 

1 P*’^{T) = -h{t), 

the last equation has only the solution (P*-® (s) , Q*’® (s)) = {—h (t ), 0), Vs G [t, T]. 

Note that, the corresponding P-function is given by 

n (s, y,v)=B. (s, y, v,p*’^ (s), y*’® (s)) = p*’® (s) bv F aq*’^ (s) - 

which is a concave function of v. Then according to the sufficient condition of optimality, see e.g. Theorem 
5.2 pp 138 in El, for any fixed initial pair {t,x) G [0,T] x R, Problem (E) is uniquely solvable with an 
optimal control u*’® (.) having the representation 

tZ*’-(s) = 6p*’® (s), VsG[t,T], 


such that the process (p*’® (.), g*’® (.)) is the unique adapted solution to the BSDE 

J dp*’® (s) = (s) dW (s), s G [t, T ], 

\ p*'^ (T) = -h (t) (A*’® (s) - x) . 
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By stadard arguments we can show that the processes {p^’^ (.), (.)) are explicitly given by 


r (s) = -M* (s) (s) - cc) , s e [t, T], 

\ (s) =-aM* (s), SG [t,T], 


where (.) 


is the solution of the state equation corresponding to (.), given by 


J dX*’^ (s) = (s) ds + adW (s), s G [t, T ], 

\ (t) = X. 


A simple computation show that 


Ttt,X 


is) = - 


bh (t) 


b^h{t) (T-s) + l 


(X‘>"(s)-x), VsG [t,T], 


clearly we have 

y^O, VsG (t,T]. (2.5) 

In the next stage, we will see that Problem (E) is time-inconsistent, for this we first fix the initial data 
(t, x) G [0, T] X R. Note that, if we assume that the Problem (E) is time-consistent, in the sense that for any 
r G [t, T\ the restriction of (.) on [r, T\ is optimal for Problem (E) with initial pair (r, X^’^ (r)) , however 
as Problem (E) is uniquely solvable for any initial pair, we should have then Vr G {t,T] 


-.t,x 




(s) = U 


(s) = - 


bh (r) 


52/1 (r) (r-s)-bl 


(a:’'’ 


(r)),VsG[r,T], 


where X'^'^ ' (.) solves the SDE 

[ (r) = X*’^ (r). 


X*’^ (r)^ ds + adW (s) ,Vs G [r,T ], 


In particular by the uniqueness of solution to the state SDE we should have 


(r) = - 


b'^h (r) {T-r) + l 




is the only optimal solution of the Problem (E), this contradict (2.5). Therefore, Problem (E) is not time- 
consistent, and more precisely, the solution obtained by the classical SMP is wrong and the problem is rather 
trivial since the only optimal solution equal to zero. 


3 Characterization of equilibrium strategies 

The purpose of this paper is to characterize open-loop Nash equilibriums instead of optimal controls. We 
use the game theoretic approach to handle the time inconsistency in the same perspective as Ekeland and 
Lazrak [B], Bjork and Murgoci [5]. Let us briefly describe the game perspective that we will consider, as 
follows. 

• We consider a game with one player at each point t in [0, T]. This player represents the incarnation of 
the controller at time t and is referred to as “player t". 

• The t — th player can control the system only at time t by taking his/her strategy u{t,.) : D —>■ R™. 

• A control process u{.) is then viewed as a complete description of the chosen strategies of all players 
in the game. 

• The reward to the player t is given by the functional J {t, u (.)). Note that J (t, u (.)) depends only 
on the restriction of the control u{.) to the time interval [t,T]. 
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In the above description, we have presented the concept of a “ Nash equilibrium point” of the game. 
This is an admissible control process u{.) satisfying the following condition; Suppose that every player s, 
such that s > t, will use the strategy u (s). Then the optimal choice for player t is that, he/she also uses the 
strategy u (t). 

Nevertheless, the problem with this “definition”, is that the individual player t does not really influence 
the outcome of the game at all. He/she only chooses the control at the single point t, and since this is a time 
set of Lebesgue measure zero, the control dynamics will not be influenced. Therefore, to characterize open- 
loop Nash equilibriums, which have not to be necessary feedback, we follow who suggest the following 
formal definition inspired by and [7]. 

Noting that, for brevity, in the rest of the paper, we suppress the subscript (s) for the coefhcients 
A{s) ,B (s) ,b(s), Cj (s), Dj (s ), aj (s), and we use the notation p (z) instead of p (s, z) for g = E,F and 
c. In addition, sometimes we simply call u (.) an equilibrium control instead of open-loop Nash equilibrium 
control when there is no ambiguity. 

We define an equilibrium by local spike variation, given an admissible control u{.) & C^jr (0, T ; R™). For 
any t € [0,T], v (11, P; R™) and for any e > 0, define 


J {t(s)-|-r!, for s G [t, t-|-e), 
( h (s), for s G [f -I- e, T], 


(3.1) 


we have the following definition. 

Definition 3.1 (Open-loop Nash equilibrium). An admissible strategy u (.) G C'^ (0, T; R™) is an open-loop 
Nash equilibrium control for Problem (LQJ) if 


Inni |j (t,X (t) ,u'^ {.)j - J (^t, A(t) ,u(.))| > 0, 


(3.2) 


for any t G [0,T], and v € lA (O, J^t,P;R™). The corresponding equilibrium dynamics solves the following 
SDE with jumps 


' d 

dX (s) = I AX (s) + Bu (5) H- ds + ('^) + i^) + i^) 

-f J \^E{z)X{s-) + F{z)u{s) + c{z)'jN{ds,dz), sG[0,T], 
z 

. Ao = xo- 


3.1 The flow of adjoint equations 

First, we introduce the adjoint equations involved in the stochastic maximum principle which character¬ 
ize the open-loop Nash equilibrium controls of Problem (LQJ). Define the Hamiltonian HI : D[0,T] x 
Ri (D, Ft, P; R") X R™ X R” X R"^'^ X {Z, B (Z) , v; R") ^ R by 

]HI(t,s, A,M,p,(7,r (.)) 


= {p, AX + Bu -I- 6) -I- {qj,DjX + CjU + aj) — — {R {t, s) u, u) 


J (r (z) ,Eiz)X + Fiz)u + c (z)) 9 (dz) -^{{Q (t, s) X, X) + (Q (t, s) E‘ [X], E‘ [X])) . 


(3.3) 


Let u{.) G £^(0,T;R'") and denote by X (.) the corresponding controlled state process. For each 
t G [0,T], we introduce the first order adjoint equation defined on the time interval and satisfied by 

the triple of processes {p (.; t ), g (.; t), r (.,.; t)) as follows 

dp (s; t) = - I A^p (s; t) J- YXJX r (s, z; t) 9 (dz) - Q {t, s) X (s) 

d „ 

—(5(t, s)E‘ X (s) ds+ ''^qj {s;t) dW^ (s) + r (s—, z;t) N {ds,dz), sG[t,T], 


[ p{T;t) = -G{t)X{T)-G{t)E* X (T) - (t) X (t) - (t), 


(3.4) 
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where q{.;t) = (gi (.;t)(.;<)). Similarly, we introduce the second order adjoint equation defined on 
the time interval [t, T], and satisfied by the triple of processes (P (.; t ), A (.; t), F (., t)) as follows 


dP (s; t) 


P{T-t) 


- \ A^P (s; t)+P{s-t)A+j2 {CjP (s; t) C, + A, (s; t) Cj 

+ Cj A, (s; t))+ E (zf (F (s, z;t) + P (s; t)) E (z) 9 (dz) 

Jz 

+ Jr {s, z;t) E (z) 9 (dz) j E [z)^ T {s^z]t)9 {dz) — Q {t^s)\ ds 
z ^ 

d « 

+ ^Aj (s; t) dW^ + / r (^“5 ^ > s € [t,T], 

JZ 


-G(t) 


(3.5) 


where A(.;f) = (Ai (.; t),A^ (.; f)). Under (HI) the BSDE (3.4) is uniquely solvable in S'^ x 

(f,T;]R”^‘^) X {[t,T] x Z;]R"), see e.g. [TT]. Moreover there exists a constant K > 0 such that 


E 


sup b(s;t)| 

R" 

t<s<T 


+ E 


|g(s;t)|R„x<j ds 


+ E 


\r{s,z-,t)\l^.^e{dz)ds 


< K 


(3.6) 


In an other hand, noting that the final data of the equation (3.5) is deterministic, it is straightforward 
to look at a deterministic solution. In addition we have the following representation 

f 

dP (s; t) = — < P (s; t) + P {s;t) A + P (s; t) Cj 

{ 3-1 

+y P {z)^ P (s; t) E [z) 9 (dz) - Q {t, s) 

P(T;t) = -G(t), 

which is a uniquely solvable matrix-valued ordinary differential equation. Indeed, if we define the function 
4) (s,.) for each s G [0, T] , as the fundamental solution of the following linear SDE 


ds, s G [t, T ], 


(3.7) 


d p 

(s, r) = A (r) 4) (s, r) dr + (r) 4> (s, r) dW^ (’’) + E (u z) 4) (s, r—) N {dr, dz), r G [s, T], 

4) (s, s) = I. 


i=i 


(3.8) 

Then, by standard arguments based on the Ito’s formula we can prove that the triple (P (.; t), A (.; t), F (.,.; t)) 
solution to (3.7) is explicitly given by 


P(s;t)=E® -4>(s,T) G(t)4>(s,T)$(s,r) Q(t,r)4>(s,r)ds ,sG[t,T], 
Aj (s; t) = 0, s G [t, T], for j = 1, 2,..., d, 

F (s, z] t) = 0, (s, z) G [t, T] X Z. 


(3.9) 


Next, for each t G [0,r], associated with the 6-tuple (jl{.), X {.) ,q {.,t) ,r {., .;t), P we 

define the Pt-function as follows 


I 

Ut {s, X, m) = H {t, s, X, u (s) + u,p(s; t) ,q{s-,t) ,r {s ,.; t)) -|- ^ 


+ ^ / u^F (z)^ P (s; t) F (z) u9 {dz), 

d Z 




(3.10) 


where (s, X, u) G [t, T] x (U, P, P; R”) x R"". 













3.2 A stochastic maximum principle for equilibrium controls 

In this section, we present a version of Pontryagin’s stochastic maximum principle which characterize the 
equilibrium controls of Problem (LQJ). We derive the result by using the second order Taylor expansion in 
the special form spike variation (3.1). Here, we don’t assume the non-negativity condition about the matrices 
Q, G and i? as in [5] and PO] . 

The following theorem is the first main result of this work, it’s providing a necessary and sufficient 
condition to characterize the open-loop Nash equilibrium controls for time-inconsistent Problem (LQJ). 

Theorem 3.2 (Stochastic Maximum Principle For Equilibriums). Let (HI) holds. Then an admissible 
eontrol u (.) € (0, T ; R™) is an open-loop Nash equilibrium, if and only if, for any t € [0, T], there exist a 

unique triple of adapted processes (p (.; t) ,q{.;t),r (.,.; t)) which satisfy the BSDE (3.4) and a deterministic 
matrix-valued function P (.;t) which satisfies the ODE (3.7), such that the following condition holds, for all 


H (t, t, X{t),u (t) -\-u,p (t; t) ,q{t-,t),r {t ,.; t)) - H (t, t, X(t),u (t) ,p(t;t),q (t; t),r{t,.; t)j 

1 1 f 

Dj (t)^ P {t; t) Dj {t)u-\- - u^F (t, z)^ P (t; t) F {t, z) u6 (dz) < 0, P — a.s, 


f=i 


z 


(3.11) 


or equivalently, we have the following two conditions, the first order equilibrium condition 


d « 

R{t,t)u{t) — B (t)^ p{t;t)— (t)^ qj {t;t) — / F {t, z)^ r {t, z;t) 6 (dz) = 0, P—a.s, (3-12) 

Jz 


i=i 


and the second order equilibrium condition 




P (t; t) Dj (t)- Jr (i) zf P {t; t) F {t, z) 9 {dz) > 0. (3.13) 


We point that the above result provides a characterisation of open-loop Nash equilibrium controls via 
a stochastic maximum principle which is not in the same setting that the classical stochastic maximum 
principle for optimal controls (22] in the sense that, the above result involves the existence of solutions 
(.), (p (.; t) , q (.; t) r (.,.; t))(g[o t]) ^ ’’flow” of forward-backward stochastic differential equations pa¬ 

rameterized by t € [0,T], while the Pontryagin’s stochastic maximum principle for optimal controls involve 
only one system of forward-backward stochastic differential equation. Note that for each t € [0, T], (3.4) and 
(3.5) are backward stochastic differential equations. So, as we consider all t in [0, T], all their corresponding 
adjoint equations form essentially a ’’flow” of BSDEs. Moreover, there is an additional constraint (3.11) 
which is equivalent to the conditions (3.12) and (3.13) that acts on the flow only when s = t. 

Our goal now, is to give a proof of the Theorem 3.2. The main idea is still based on the variational 
techniques in the same spirit of proving the stochastic Pontryagin’s maximum principle m 

Let u (.) € (0, T; R™) be an admissible control and X (.) the corresponding controlled process solution 

to the state equation. Consider the perturbed control (.) defined by the spike variation (3.1) for some 
fixed arbitrary t G [0, T], f G (H, P; R™) and e G [0, T — t]. Denote by (.) the solution of the state 
equation corresponding to (.). Since the coefficients of the controlled state equation are linear, then by 
the standard perturbation approach, see e.g. m, we have 


A^ (s) - A (s) = (s) + (s), s G [t, T ], 

where y®’” (.) and z®’” (.) solve the following linear stochastic differential equations, respectively 


(3.14) 


(s) = (s) ds-\- Y) {Qy®’” (s) -b Djvl[t,t-\-e) (s)} dw^ (s) 


i=i 


+ / {Riz) y®’” (s-) -b F (z) (s)} A {ds, dz), s€[t,T], 


(3.15) 


y"’" it) = 0, 
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and 


(s) = {Az'^’’" (s) + Bvl[t,t+s) (s)} ds+ CjZ^’’" (s) dW^ (s) 

i=i 


+ / E {z) {s) N [ds^dz) ^ se[t,T], 


(3.16) 


[ z®’*' (t) = 0. 


Now, we present the following technical lemma needed later in this study, see the Appendix A.l. for its 
proof. 


Lemma 3.3. Under assumption (HI), the following estimates hold 

E* (s)] = 0, a.e. s€[t,T] and sup |E‘[z® (s)]|^ = O (e^) , 

se[t,T] 

'=0(e^). 


E* 

sup |y®(s)|^ 

= O (e) and E* 

sup |z^(s)|" 


se[i,T] 


se[i,T] 


(3.17) 

(3.18) 


Moreover, we have the equality 


j(t,Xit),u^.)) -j(t,X{t),ui.)) 


= -E* 


J {h (t, s, X{s),u (s) + v,p (s; t) ,q(s;t),r (s ,.; t)^ 

- H (^t, s, X{s),u (s) ,p{s;t),q (s; t ), r (s,.; t)^ | l[t,t+e) (s) ds 

+ ^J lj2^v^ddJP{s;t)DjV+ J v^F{z)^ P{s;t)F{z)ve{dz)'^l[t,t+e) (s) ds 


+ o (e). 


(3.19) 


Now, we are ready to give the proof of the Theorem 3.2. 

Proof of Theorem 3.2. Given an open-loop Nash equilibrium u{.), then for any t € [0,T] and 
V gIA (O, Ft, E; E™) 7 we have clearly 


lim — 

eiO £ 


^j(t,x (t) , it (.)) - j(t,x (t ), (.)) I 


< 0 , 


from which we deduce 

1 , 


lim-E* 

6 —>-0 S 


< 0 , 


|h (t, s, X{s),u (s) -h v,p{s; t) ;q{s;t),r (s,.; t)) 

-H (t, s, X{s),u (s) ,p{s;t),q (s; t ), r (s,.; t)j 

+ ^'^v^DjP{s-,t)DjV+ i/ v~^F{z f P{s;t)F {z)v9idz)\In,t+s){s)ds 
i=i ^ 

which leads to 

H (t, t, X{t),u {t) + v,p{t] t) ,q{t;t),r {t ,.; t)) - H (t, t, X (t) ,u (t) ,p{t;t),q (t; t),r{t,.; t)^ 

1 . ^ -v 1 f 

F Q (0^ P (^; 0 Pj (0 ^ o P (^) E P (^; t) P (^) (dz) < 0, P — a.s, 


j=i 


Therefore, by stetting u = m for an arbitrarily u € K™ we obtain (3.11). 

Conversely, given an admissible control u{.) G (0, T; R™). Suppose that for any t G [0,T], the 
variational inequality (3.11) holds. Then for any v gIA (ri,F (t) ,P;]R’") it yields 

H (t, t, X{t),u {t) + v,p{t] t) ,q{t;t),r {t ,.; t)^ - H (t, t, X (t) ,u (t) ,p{t;t),q {t; f) , r {t ,.; t)^ 

1 ^ 1 f 

+ 7 :'Ev^ Dj (t)^ P (t', t) Dj (t) w + - / v^ F (t, z)^ P {t; t) F (t, z) v6 {dz) < 0, P — a.s, 
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consequently 


lim -E* 

eio e 




Ut 


|h (t, s, X (s), u (s) +v,p (s; t) ,q{s;t),r (s, t)j 
-H (t, s, X{s),u (s) ,p{s;t),q (s; <), r (s, t)'^ 

+ P (s; t) DjV + P (s; ^ (^) 


Hence 


i=i 


< 0 . 


{>/ W ,■«(■)) - J (.))! < 0, 


thus u{.) is an equilibrium control. 

Easy manipulations show that the variational inequality (3.11) is equivalent to 




(t, X (t), o') = max Kt (t, X (t), u] , 
\ / \ / 


then (3.12) and (3.13) follow respectively from the following first order and second order conditions of the 
maximum point u = 0 for the quadratic function u —> X (t ), 

(t, X(t),0^ =0 and (t, X (t) ,u'j <0. 

Then, the required result directly follows. ■ 

In Theorem 3.2, in view of condition (3.11), as long as the term 


- (i)^ P {t\t) Dj {t)+ jF {t, z)^ P (t; t) F {t, z) 9 (dz) j , 

for each t G [0, T] is sufficiently positive definite, the necessary and sufficient condition for equilibriums might 
still be satisfied even if R {t, t) is negative. This is different from and [5D] where the authors have assumed 
the non-negativity of the matrices Q, G and R in order to state their stochastic maximum principle for 
open-loop Nash equilibriums. Moreover, in view of (3.9) in the case where Q (t, s) > 0 for every s G [t, T ], 
and G (t) > 0, it follows that the solution of the second order adjoint equation satisfies P (t; t) < 0, then if 
further we have R it, t) > 0, Thus the condition that 


R (t, P] P (^; t) Pj (t) - 




JF (t, z)^ P {t; t) F (t, z) 9 (dz) > 0, 
z 


is obviously satisfied. Therefore, we summarize the main theorem into the following Corollary. 

Corollary 3.4. Let (H1)-(H2) hold. Then an admissible eontrol u{.) G £3^(0,T;K"*) is an equilibrium 
control, if and only if, for any t G [0, T], there exists a triple of adapted processes (p (.; t) ,q{.;t),r (.,.; t)) 
which satisfies the BSDE (3.4), with only the first order condition (3.12) holds. 


4 Linear feedback stochastic equilibrium control 

In this section, we consider only the case where the Brownian motion is one-dimensional (d = 1) for sim¬ 
plicity of presentation. There is no essential difficulty with the multidimensional Brownian motions. All 
the indices j will then be dropped. Our goal is to obtain a state feedback representation of an equilib¬ 
rium control for Problem (LQJ) via some class of ordinary differential equations. Suppose that u (.) is an 
equilibrium control and denote by X (.) the corresponding controlled process. Then in view of Theorem 
3.2, there exists a flow of triple of adapted processes (p (.; t), g (.; t), r (.,.; t))jgjQ for which the 3-tuple 

(it (.), X (.), {p (.; t) ,q{.;t) ,r (., .; t))(g[o j solves the following flow of forward-backward SDE with jumps. 
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parametrized by t G [0,r] 

dX (s) = ^AX (s) + Bu (s) + &! ds + |CX (s) + Du (s) + (t| dW (s) 

+ J (z) X (s—) + F {z)u{s) + c{z)'^ N {ds,dz), sG[0,T], 
z 

dp (s; t) = - {A^p (s; t) + C^q (s; t) + J E (z)^ r (s, z; t) 6 (dz) - Q {t, s) X (s) 

z 

—Q(t, s)E* X (s) I ds + g (s; t) dVb (s) + J r {s—, z;t) N {ds,dz), 0<t<s<T, 

z 

Xo = xo,p{T;t) = -G{t)X{T)-G{t)E* (T)] - {t) X {t) - p2 (t), tG [0,T], 

(4.1) 

with the condition 

R {t, t) u (t) — B^p {t; t) — D (t)^ q{t;t) — j F {t, z)"’" r {t, z; t) 9 {dz) = 0, P — a.s, Vt G [0, T]. (4.2) 

J z 

Now, to solve the above stochastic system, we conjecture that X (.) and p{.',t) for t G [0,T) are related 
by the following relation 

p{s;t) =—M {t,s) X (s) — M {t,s)E* X (s) — T {t, s) X {t) — (p {t, s), {t, s) € D [0,T], (4.3) 

for some deterministic functions M (.,.), M (.,.), T (.,.) G {D [0, T], M^^") and ip (.,.) G {D [0, T], K") 
such that 


M{t,T) = G{t), M{t,T) = G{t), T{t,T) = p,{t),p{t,T) = p2it), te[0,T]. 
Applying Ito’s formula to (4.3) and using (4.1), it yields 

dp(s;t) = (t,s) i:(s) - ^ (t,s)E* A'(s) - ^ {t, s) X {t) - ^ {t, s) 

— M {t, s) (^AX (s) + Bu (s) + — M (t, s) ^AE* X (s) + BE* [u (s)] + 5^ | ds 

— M {t, s) (^GX (s) + Du (s) + a'j dW (s) 

— J M {t, s) (^E (z) X (s—) + F {z)u (s) + c (z)'j N {ds, dz) , 

= - I (s; t) + G^q (s; t) + J E (z)^ r (s, z; t) 6 (dz) - Q {t, s) X (s) 

—Q{t,s)E* X (s) I ds + g (s; t) dVb (s) + J r {s—,z',t) N {ds,dz), s G [t,T], 


and we obtain 


g (s; t) = —M {t, s) (^CX (s) + Du (s) + a'j , 
r (s, z; t) = -M {t, s) (^E (z) X {s) + F (z) u{s) + c (z)^ . 


(4.5) 

(4.6) 

(4.7) 
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By taking (4.6) and (4.7) in (4.2) we get 

0 = R{t,t)u (t) + B (t)^ {jyM (t, t) + M {t,t) + T (t, t)) X {t) + if {t, t) 

+ D {t)^ M {t, t) (^C (t) X (t) + D (t) u{t) +a (t)^ 

+ y F (t, z)^ M {t, t) (^E {t, z) X (t) + F (t, z) u{t) +c {t, z)j 6 (dz), 

= + D (t)^ M {t,t) D (t) + J F {t, z)^ M {t,t) F {t, z) 9 (dz)^ u{t) 

+ ^B (t) (^M (t, t) + M (t, t) + T (t, t)) + D (t) M it, t) C (t) 

+ J F {t, z)^ M {t,t) E {t, z) 9 (dz)^ X (t) 

+ B (t)^ if {t, t) + D (t)^ M (t, t)a (t) + ( F {t, z)^ M {t, t) c {t, z) 9 {dz). 

J z 

Thus if we assume that 0 (t) = (^R (t, t) + D (t)^ M {t, t) D {t) + F (t, z)^ M {t, t) F {t, z) v {dz)^ 
exists, then we deduce that u{.) admits the following feedback representation 

u (t) =(t) X (t) - Ip (t), (4.8) 

where 4' (t) and xp (t) are given by 

' ^^l{t) = e (t) (tf {M {t, t) + M (t, t) + T (t, t)) + D {tf M {t, t) 

< +J F{t,z)^ M {t,t)E{t,z)9{dz)^ , (4 9 ) 

xp{t) = Q (t) ^B (t)^ f [t, t) + D (t)^ M {t, t)a{t) + J F {t, z)^ M {t, t) c {t, z) 9 (dz)^ . 

Therefore, for any {t, s) G D [0, T] , we have 

E* [u (s)] = 4' (s) E* (s)l +'iP{s). (4.10) 

Next, comparing the ds term in (4.5) by the one in (4.1), then by using the expressions (4.8) and (4.10), 
we obtain 

0= (t,s) + M {t,s) A + M {t,s) + M {t,s)C + J E (z)^ M {t,s) E {z)9 {dz) 

- (^M {t,s) B + M {t,s) D + J E{z)^ M{t,s)F (z) 9 {dz)'^'i>{s)+Q{t,s)'^X{s) 

+ { ^ (t, s) + M {t, s) A + A^M {t, s)-M {t, s) B^ (s) + Q {t, s)| E* [x (s) 

+ '®) F A^T {t, s)|’ ^ (t) 

+ ^ {t, s) + (M (t, s) + M (t, s)) {b - Bxp (s)) + A~^f {t, s) 

+ M {t, s) (ct — Dip (s)) + [ E (z)^ M {t, s) (c (z) — F {z)xp (s)) 9 {dz) 

J z 


This suggests that the functions M {.,.), M {.,.) ,T {.,.) and f{-,-) solve the following system of ordinary 


13 



differential equations, for {t,s) € D [0,r] 

^ {t, s) + M (t, s) A + A^M (t, s) + C^M (t,s)C+ [ E (z)^ M (t, s) E (z) 0 (dz) 

ds Jz 

- ^ M {t, s)BE C^M {t,s)D + J E {zf M {t, s) E (z) 0 (dz)] (s) + Q (t, s), 

{t, s) + M (t, s) A + M (t, s) - M {t, s) B'l’ (s) + Q {t, s ), 

0 = — (t,s) + {t,s) , 

0 = {t, s) + (M {t, s) + M (t, s)) {b — Bip (s)) + AA' ip {t, s) + M [t, s) (a — Dtp (s)) 

+ J E{zY M{t,s) {c{z)-F{z)'ip{s))0{dz), 

M it, T) = G it), M it, T) =G it), T it, T) = it), p it, T) = it) ,t € [0,T], 


0 = 


where 

det Rit,t) + D it)^ M it,t) D it) + j F it, z)~^ M it,t) E it, z) 0 

J z 

and is) ,'ip is) are given by (4.9). 


idz) 


^ 0 , tG [o,r], 


(4.11) 


4.1 Verification Theorem 


The following theorem provides a verification argument. 

Theorem 4.1. let (H1)-(H2) hold. Let M (.,.), M (.,.), T (.,.) and p (.,.) be the solution of the system 
(4.11). Then u(.) given by (4.8) is an equilibrium control. 

Proof. First, we can check that (.) and ■)/)(.) in (3.15) are both uniformly bounded. Then the following 
linear SDE 

dX (s) = |(Gl — B’^ (s)) X is) + b — Btp (s)| ds 

+ |(C' — D’^ (s)) X (s) + (J — Dip (s)| dW (s) 

+ J I (E (z) - F (z) T (s)) X (s-) + c (z) - F (z) V' (s) I ids, dz), for s G [0, T], 

. V (0) = xo, 

is uniquely solvable, and the following estimate holds 




2 

sup 

V(s) 


sG[0,T] 




<K{l + xl). 


So the control u(.) defined by (4.8) is admissible. Moreover, by definition of (p (s; t), g (s; t), r (s, z; t)) 
via (4.3), (4.6) and (4.7), respectively, and by applying the Ito’s formula, we can easily check that, for each 
t G [0, T] the processes (p (.; t) ,qi.;t) ,r (.,.; t)) satisfy (3.4). 

Finally, in view of Corollary 3.4, it’s remains to check that the condition (3.11) holds. To this end, 
we substitute ip it;t), q it;t) ,r it, z;t) ,u it)) taken from (4.3), (4.6), (4.7) and (4.8), respecively, in the 
expression 


K it) = R it, t) uit) — B it)^ p it; t) — D it)~^ q it; t) — ( F (t, z)^ r (t, z; t) 0 idz), 

Jz 

we get 

K it) = R it, t) uit) + B it)^ I (M it, t) + M it, t) + T it, t)) X it) + p it, t) I + F (t)^ M it, t) (^G it) X it) 

+ D it) uit) + a it)) + J F it, z)^ M it, t) (^E it, z) X it) + F it, z) uit) + c it, z)^ 0 idz) 

= 0 it)~^ (u it) + 4* (t) X it) + Ip it)^ , 

the representation (4.8) shows that K it) = 0. Hence, by Corollary 3.4, u (.) is an open-loop Nash equilibrium 
control. ■ 
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Note that, the verification theorem (Theorem 4.1 ) assumes the existence of a solution to the system 
(4.11). However, since the ODEs which should be solved by M (.,.) and M (.,.) do not have a symmetry 
structure. The general solvability for this type of ODEs when (n > 1) remains an outstanding open problem. 
We will see in the next section two examples in the case when n = 1, this case is important, especially in 
financial applications as the mean-variance portfolio selection model. Also, we remark that a special feature 
of the case when n = 1 is that the state X (.) is one-dimensional, so are the unknown M (.,.), M (.,.), T (.,.) 
and ip (.,.) of the system (4.11). This makes it easier to solve (4.11). 

5 Some applications 

5.1 Mean-variance portfolio selection problem 

In this subsection, we discuss the continuous-time Markowitz’s mean-variance portfolio selection problem. 
We apply Theorem 4.1 to obtain a state feedback representation of an equilibrium control for the Problem. 
In the absence of Poisson random jumps this problem is discussed in [5]. 

The problem is formulated as follows: We consider a financial market, in which two securities are traded 
continuously. One of them is a bond, with price S° (s) at time s € [0,r] governed by 

dS° (s) = S° (s) r (s) ds, S° (0) = sq > 0. (5.1) 

There is also a stock with unit price (s) at time s S [0,T] governed by 

dS^ (s) = (s-) (s) ds + /3 (s) dW (s) + J 7 (s, z) N {ds, dz^ , ( 0 ) = > 0 . (5.2) 

where r : [0,T] ^ (0,oo), a, ,5 : [0, T] ^ R and 7 : [0, T] x R* —>■ M are assumed to be deterministic and 
continuous, we also assume a uniform ellipticity condition as follow a (t)^ ^ 

some i5 > 0. For an investor, a portfolio n (.) is a process represents the amount of money invested in the 
stock. The wealth process (.) corresponding to initial capital xq > 0, and portfolio 7r(.), satisfies 

then the equation 


{ dX (s) = (r (s) A (s) -I- TT (s) (a (s) — r (s))) ds -\- tt (s) (3 (s) dW (s) 

-|- 7 r(s) J -f {s,z) N {ds,dz), forte [ 0 ,T], (5.3) 

A(0) = zo. 


As time evolves, we need to consider the controlled stochastic differential equation parametrized by 
(t, ^) G [0,r] X (D, J^t,P;R) and satisfiied by X (.) 

{ dX (s) = (r (s) A (s) -I- TT (s) (a (s) — r (s))) ds + tt (s) /3 (s) dW (s) 

-|- 7 r(s) / j {s, z) N {ds,dz), forsG[t, T], (5.4) 

Jr* 

A(t) = e 

The objective is to maximize the conditional expectation of terminal wealth E* [A (T)], and at the same 
time to minimize the conditional variance of the terminal wealth Var* [A (T)], over controls tt (.) valued in 
R. Then, the mean-variance portfolio optimization problem is denoted as: minimizing the cost J(t, ^,.), 
given by 

J {t, TT (.)) = ivar‘ [A (T)] - {p, {t) ^ + ^2 (t)) E* [A (T)], (5.5) 

subject to (5.4). Here /ii, /i 2 : [0, T] —>■ (0, 00) , are some deterministic non constant, continuous and bounded 
functions. The above model cover the one in [5], since, in our case, the weight between the conditional variance 
and the conditional expectation depends on the current wealth level, as well as, the current time, while in 
[S] the weight depends on the current wealth level only. Hence, in the above model, there are three different 
sources of time-inconsistency. Moreover, the above model is mathematically a special case of the general LQ 
problem formulated earlier in this paper, with n = d = m = 1. Then we can apply Theorem 4.1 to obtain a 
Nash equilibrium control. We recall that, the definition of equilibrium controls is in the sense of open-loop, 
which is different from the feedback one in 0 , a and [35]. 
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The optimal control problem associated with (5.4) and (5.5) is eqnivalent to minimize 


J it, e, u (.)) = i (e‘ [x {Tf] - E* [X (r)]2) - ifi, it) e + M2 (i)) E‘ [X (T)] 
subject to (5.4). Denote 

p{t)=l3{tf+f e{dz). 

Jr* 

Thus, the system (4.11) reduces to 

(a(s) -r(s))^ 


dM I r. / \ 

— {t,s) + <j 2r (s) - 

— (t,s) + 2r (s) - 


M {s,s)p{s) 
{a{s)-r{s)Y 


ds 

dT 


^ {t, s)+r (s) ip (t, s) = 0, V {t, s) G D [0, T], 

[ Mit,T) = l, Mit,T) = -l, T(t,T)=Mi(t), p it,T) = (t), Vte[0,T]. 

Clearly, if M (.,.) and M (.,.) are solutions to the first and the second equations, respectively, in (5.6), 
then M (.,.) = (M + M) (.,.) solves the following ODE 


M (s,s)p(s) 

{t, s) +r (s) T (t, s) = 0, V (t, s) € D [0, T] 


{M (s, s) + M (s, s) + T (s, s)) ^ M {t, s) = 0, Vs e [0, T], 
(M (s, s) + M (s, s) + T (s, s)) i M (t, s) = 0, Vs e [0, T], 


(5.6) 


dM 


it, s) + I 2r (s) - {m (s, s) + T (s, s)) j> M {t, s) = 0, V [t, s) e [0, T ], 


M{t,T) = 0, tG [0,T], 
which is equivalent to 


Is(^dr) 

M{t,s)=M{t,T)e V 

= 0, (t,s)GD[0,T] 


(a(T)-r(T))^ 


(M (t,t)+T (t,t 


dr 


hence 


M (t, s) + M {t, s) = M {t, s) = 0, V {t, s)gD [0, T]. 


Moreover, we remark that all data of the ODEs which shonld be solved by M (.,.) and M (.,.) are not 
influenced by t, thus (5.6) reduces to 


T(s,s) = 0, VsG [0,T], 


M(s) = -M(s), Vs € [0,r], 

^ it, s)+r (s) T {t, s) = 0, V (t, s) GD [0, T], 

(t, s) + r (s) p {t, s) = 0, V (t, s) GD [0, T ], 

[ mV) = 1, T(t,r) = Mi(<), pit,T)=p2it), Vf€[0,T]. 

which is explicitly solved by 

M(s) = + ^ i^) , Vs€[0,r], 


M{s) = -e^ I ^ ^ / e- (t) 

T (f, s) = Ml it) el V {t, s)gD [0, T ], 


(g (t) -r(T))^ 

Pi^) 


dr L Vs G [0,r] 


I Pit,s) = M 2 it)eJs 


Is 


V(t,s)GD[0,r]. 


(5.8) 


(5.9) 
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In view of Theorem 4.1, the representation of the Nash equilibrium control (4.8) then gives 


it (s) = 'I' (s) X (s) + i/i (s) ,Vs G [0,T], 


where, Vs G [0, T] 


4'(s) 


(a (s)-r(s)) 
M (s)p(s) 


T (s, s) and ^ (s) = 


(a(s) -r (s)) 
M (s)p(s) 


V3(s,s). 


The corresponding equilibrium dynamics solves the SDEJ 

dX (s) = |(r (s) — 4' (s) (a (s) — r (s))) X {s) — ip (s) {a (s) — r (s))| ds 

- ^4' (s) X (s) +ip (s)^ |/3 (s) dIT (s) + y 7 (s, 2 ) TV (ds, d 0 )| , for s G [0, T], 

X (0) = xo. 


Remark 5.1. In the absence of Poisson random jumps, we have the following items 


1. In the case where /ii (t) = > 0 and ^2 (t) = M 2 > 0, the solution obtained, for the mean-variance 

problem, coincides with that obtained by [9]- 

2. In the case where mi (t) = Mi > 0 and /i 2 {t) = 0, the solution obtained, coincides with that obtained 

by [5]. 

3. In the case where mi (t) = 0 and M 2 (t) = M 2 > 0, the solution, obtained coincides with that obtained 

by n. 


5.2 General discounting LQ regulator 

In this subsection, we consider an example of a general discounting time-inconsistent LQ model. The 
objective is to minimize the expected cost functional, that is earned during a finite time horizon 




|M(s)|^ds + /i(t)|x(r)-cr 


(5.11) 


where h (.) : [0, T] (0, 00 ), is a general deterministic non-exponential discount function satisfying M (0) = 1, 
h{s) > 0 and /g h{t)dt < 00 . Subject to a controlled one dimontional SDE parametrized by (t,^) G 
[0,T] X L2 (fl, J't,P;K) 


dX (s) = {aX (s) -I- bu (s)} ds + adW {s) + c j N (ds, dz ), s G [0, T], 

X(t)=^, 


(5.12) 


where a,b and c are real constant. As mentioned in [2], this is atime-inconsistent version of the classical 
linear quadratic regulator, we want control the system so that the final sate X (T) is close to ^ while at 
the same time we keep the control energy (formalized by the integral term) small. Note that, here the 
time-inconsistency is due to the fact that the terminal cost depends explicitly on the current state ^ as well 
as the current time t. Hence there are two different sources of time-inconsistency. For this example, the 
system (4.11) reduces to 


—— (t, s) + 2aM (t, s) — b^M (t, s) {M (s, s) + M (s, s) -I- T (s, s)} = 0, V (t, s) € D [0, T], 
—— (t, s) + 2aM (t, s) — b'^M (t, s) {M (s, s) + M (s, s) -I- T (s, s)} = 0, V (t, s) G D [0, T]. 
(t, s) + aT (t, s) = 0, V (t, s) € D [0, T], 


(5.13) 


s 

^ (t, s) + a(p (t, s) — b'^ {M (t, s) + M (t, s)} (/? (s, s) = 0, V (t, s) e D [0,T], 
M(t,T) = h{t), M(t,T)=0, T(t,T) = h{t),^(t,T) = G, VtG[0,T], 
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obviously, T is explicitely given by 

T {t, s) = h (t) exp {a (T — s)} , W {t, s) € D [0, T]. 


(5.14) 


Moreover, we can check that M (.,.), M (.,.) and Lp (.,.) solve (5.13), if and only if, they solve the following 
system of integral equations 


[ M{t,s)=M it, T) e^s {2a-6"(M(T.T)+M(r,r)+T(r,r))}dr^ ^ s) G £> [0, T] . 

M {t, s)=M {t, T) e^s {2a-6^(M(r.r)+M(r,r)+T(r,r))}dr^ y ^ ^ ^ 


(5.15) 


(5.16) 


(p{t,s) = ~ {^it) + [tiT)) ip{T,T)dT, M {t,s) € D[0,T], 

on the other hand, we have M {t,T) = ip{t,T) = 0, then (5.15) reduces to 

M (t, S)=M {t, T) eXf {2a-b^M(r.r)+T(r.r))}dr^ y ^ ^ ^ 

M(t,s) =0, V(t,s) G L>[0,T], 

(/? (t, s) = —b^ j {t, t) (/? (r, r) dr, V (t, s) G D [0, T]. 

J S 

It is clear that if M (.,.) is the solution of the first equation in (5,16), then 

(p (s, s) = —b^ j (s, r) (p (r, r) dr, Vs G [0, T \, 

J S 

r 

thus, there exists some constant L > 0 such that \p> {s,s)\<L \ip (r, r)| dr, then by Gronwall Lemma, we 

J S 

conclude that tp (s, s) = 0, Vs G [0, T]. Therefore 

ip{t,s)=0, V(f,s) G7d[0,T]. 

is the unique solution to the last equation in the system (5.16). 

Now, it’s remains to solve the first equation in the system (5.16). We have the following Theorem. 

Theorem 5.2. The first equation in (5.16) has a unique solution in C (D [0,T] ,M^). 

Proof. See the proof in Appendix A.2. ■ 

In view of Theorem 4.1, the representation (4.8) of the Nash equilibrium control, then gives 

M (s) = 6 {T (s, s) - M (s, s)} A (s), Vs G [0, T], (5.19) 

and the corresponding equilibrium dynamics solves the SDEJ 

dX (s) = {a + 6^ (T (s, s) — M (s, s))} X (s) ds + adW (s) + cj zN (ds, dz ), s G [0, T], 

A (0) = xo. 

To conclude this section let us present the following remark. 

Remark 5.3. The Problem (E) given by the subsection 2.3, is in fact shown to be a particular case of the 
general discounting LQ regulator model, formulated earlier in this paragraph, in the case when a = c = 0, 
and the final data ^ = x, this leads to the following representation of the Nash equilibrium control of this 
problem 

u{s) =b {h (s) — M (s, s)) A (s), Vs G [0, T ], 

where M {t, s) solves 

M {t, S)=h {t) J^-b^mr,r)+h(r))dr^ ^ ^ ^ 

and the corresponding equilibrium dynamics solves the SDE 

( dX (s) = 6^ {h{s) — M (s, s)} A (s) ds + adW (s), s G [0, T ], 

I A(0) =xo. 

This in fact, the correct solution of Problem (E). 
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Conclusion cind. future work, in this paper, we have studied a class of dynamic decision 
problems of a general time-inconsistent type. We have used the game theoretic approach to handle the time 
inconsistency. During this study open-loop Nash equilibrium controls are constructed as an alternative of 
optimal controls. This has been accomplished through stochastic maximum principle that includes a flow 
of forward-backward stochastic differential equations under maximum condition. The inclusion of concrete 
examples confirms the validity of our proposed study. The work can be extended in several ways. For example, 
this approach can be extended to a mean field game to construct decentralized strategies and obtain an estimate 
of their performance. The reserch on this topic is in progress and will appear in our forthcoming paper. 


6 Appendix: Additional proofs 


A.l. Proof of Lamma 3.3. Let t S [0,T], w G (O, P; R"*) and e G [0,T —t] . Since E* (.)] and 
E* [z®’’' (.)] solve the following ODEs, respectively 

dE* (s)] = AE* (s)] ds, s G [t, T], 

E‘ [y^’'^ (t)] = 0, 

and 

r dE‘ (s)] = {AE‘ (s)] + BE* [n] (s)} ds, s G [t,T], 

\ E* [z^’'’ (t)] = 0. 

Thus, it is clear that E* (s)] = 0, a.e. s G [t, T]. According to Gronwall’s inequality there exists a 
positive constant K such that sup^gj^ yj |E* (s)]|^ < Ke^. Moreover, by Lemma 2.1. in [T7], we obtain 

(3.18). 

By these estimates, we can calculate the difference 


J {t,X {t) ,u^ {.)) - J {t,X (t) ,u{.)) 


= E* 


/ J 

{ (q (t, s) A (s) + Q (t, s) E* [a is)] , 2 /-’’' (s) + z^-’' (s)) 


+ I (Q (t, s) iy^’^’ is) + z^’^’ is )), y^’^ (s) + z"-" (s)) 

+ ^{Q if s) E* [y^’'’ is) + z^'" (s)], E* [y"-" (s) + z"’" (s)]) 

+ iRit,s)uis) ,v) l[t,t+£) (s) + i {Rit,s)v,v) l[t,t+e) (s)| ds 
+ Ug it) iy^’^ (T) + Z^’’' (T)), y-’- (T) + Z^’“ (T)) 

+ (g it) A (T) + G it) E* [a (T)] + PL, it) X it) + M2 it), y"’" (T) + z"-’' (T) 
+ i (G it) E* [y"’" (T) + z"’" (T)], E* [y"’" (T) + z"’" (T)]) 

In an other hand, from (HI) and (3.17) — (3.18) the following estimate holds 


E* 


(Q if s) E* [y-’^’ is) + z^-^ (s)], E* (s) + is)]) ds 

+ i (G (t) E* [y^’^ (T) + z"'" (T)], E* [y^’^ (T) + z^’" (T)]) = o (e). 
Then, from the terminal conditions in the adjoint equations, it follows that 
J (t, X it), (.)) -j(t,X it) , u (.)) 

= E* 


{(|Q(t,s)A(s) + Q(t,s)E‘ A(s) , (s) + z”*-" (s)^ 

+ 1{Q it, s) ( 2 /^’“ is) + z^’“ (s)), is) + z--" is)) 

+ {Rit,s)uis) ,v) l[t,t+£) (s) + i {Rit,s)v,v) l[t,t+e) (s)| ds 

[Tv 

+ o(e). 


- ip (T; f ), y^’^ (T) + z^’^' (T)) - - (P (T; t) (T) + z^'" (T)), (P) + z->" (T)) 


( 6 . 1 ) 


( 6 . 2 ) 
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Now, by applying Ito’s formula to s {p (s; t) , y®’*' (s) + (s)) on [f, T], we get 

{p{T;t),y^’- (T) + z--" (T)) 

T 

= J [(Bv)^ pis;t)lit,t+e)is) + (s) + (s))^ (q {t,s)X (s)+ Q{t,s)E* X (s) ) 

t 

+ '^2 (^1^2 Ij (s; t) \t,t+e) (s) + f {F (z) v)^ r (s, z; t) l[t,t+e) (s) e {dz)\ds 
j=i J 

d T 

+X1 / + Djvlit,t+e) (s))^p(s;t) + iy^’’" (s) + z”'-" (s))^ Qj (s;t)| dW^ (s) 


i=i 

T 


(6.3) 


+ 


t z 


E (z) (s) + z^’’" (s)) + F (z) vl[t,t+e) (s)) P («; i) 

+ iy^’^ (s) + z"'" (s))^ r (s, z; f)} TV (ds, dz). 

By applying Ito’s formula to s ^ {P (s; t) {y^’’" (s) + z^’’" (s)) , 2 /^’" (s) + z^’" (s)) on [t,T] , we get 
(P (P; t) iy^’^ (T) + z^’" (T)), (T) + z"-’’ (T)) 

T 

= J {2 (y--- (s) + z^’" (s))^ P (s; T) Pul[t,i+,) (s) + (y^’" (s) + z^’" (s))^ Q {t, s) (y"-’’ (s) + z"-’’ (s)) 

t 

d 

+ ^ (2 (y-'- (s) + z->" (s))^ CjP (s; t) (s) 

i=i 

+ D]P {s-,t) (s)) 

+ J {2 (y---(s) + z^’" {s))^ E{zf P{s;t)F{z)vl[t,t+,) (s) 

z 

+ v^E (z)"^ P (s; t) P (z) ul[t_t+£) (s) 9 (dz)| fis 

d ^ 

+ 2^y {(y-.- (s) + z->’’ (s))^ P (s; t) (Q (y^-" (s) + z->" (s)) + P,«![*,«+,) (s))} dW^ {s) 


(6.4) 


i=P 


+2 / / {(y--" (s) + z^’" (s))^ P (s; T) (P (z) (y"-’’ (s) + z"-’’ (s)) + P (z) «![*,*+,) (s)) | TV (ds, dz ), 


t z 


Moreover, we conclude from (HI) together with (3.17) — (3.18) that 


E* 

E* 

E* 


■ fT 

J {s) +Z^’'^ (s))^ P{s-,t)Bvl[t,t+e){s)ds =o(e), 

' fT 

J ( 2 /®’" (s) + (s))^ CjP (s; t) Djvl[t,t+e) (s) ds 


= o (e), 


(6.5) 


y y ( 2 /"’" (s) + (s))^ E (z)^ P (s; T) P (z) ul[t,*+,) (s) 6 {dz) ds 


= o{e). 


By taking the conditional expectation in (6.3) and (6.4), then by invoking (6.5) it hold that 

E‘ Kp(P;T),y^’" (P) + z"-’’ (P))] 


= E* 


1 

J {v'^B^p (s; t) l[t,t+s) (s) + ( 2 /^’"’ (s) + z^’"” (s))"^ (^Q (T, s) X (s) + Q {t, s) E* X (s) 

t 

+ v^dJ qj (s; t) l[t,t+e) (s) + J F (z)"^ r (s, z; t) l[t,t+e) (s) & (c^z)| ds 


( 6 . 6 ) 
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and 


-E‘ [{P (T; t) (T) + (T)), (T) + z"’’' (T))] 


= -E* 
2 


Z. d 

I {(s) + z"''" (s))^ Q {t, s) (y®’*' (s) + z®’*' (s)) +Y^ v^dJP (s; t) Djvlit,t+s) (s) 

t i=i 


+ J v^F{z)^ P{s;t)F{z)vlit,t+e)is)9{dz)jds 
By taking (6.6) and (6.7) in (6.2), it follows that 
J (t, X (t), (.)) -j(t,X (t), u (.)) 


o (e). 


(6.7) 


= -E* 



B^p (s; t) + Y^ v^dJ Qj (s, t) +-Y^ v^dJP (s; t) D, 


i=i 


—v^R {t, s) u (s) — ^ 


i=i 


+ / (^’(s, 2:;i)^i^(2:)?^ + (5;^)-F (z) V ) 6 » (dz) [■ l[t,t+£) (s)(is 


■ o (e), 


which is equivalent to (3.19). ■ 

A.2. Proof of Theorem 5.2. For a constant /3 > 0, to be fixed later, we introduce the following norm, 
ioT f {.,.)€ C (D [0,T],R) 


II 00./3 = sup 

(t.s)6D[0.T] 


,-/3(T-; 


V(i, 


it is easy to check that e ||/||^ < \\f\\^ji < \\f\\^ , for every / € C (ii) [0, T], R), hence the norm ||.||^_^ 
is equivalent to ||.||^ on the Banach space C {D [0,T] ,]R) .We introduce the following nonlinear operator, 
L[.] : C {D [0, T ], R+) C {D [0, T] , IR+), such that for all /(.,.) G C (Z1 [0, T] , R+), we have 


L [f {t, S)] = L [/] {t, S) = h (t) e^s {2a-Pr{r,r)-Pf{r,r))dr_ 

Then our problem is equivalent to a fixed point problem for the operator L [.] in the closed subset 
C {D [0, T] ,MS-) of the Banach space (^C {D [0, T] ,]R), ||.||^ . 

1) Existence of solution. It is clear that L\] is well defined. Now, consider /i ,/2 G C ([0,r] 
then 


L [/i] (t, s)-L [/a] (t, s)=h {t) eXT(2a-&^T(r.r))dr (^-P /l(r.r)dr _ hP,r)dr\^ 


( 6 . 8 ) 


n J 

we put A (t, s) = h (t) eX (2“-^ T(r,r))dT^ y s) € D [0, T] , obviousely A (.,.) is uniformely bounded. Then 
there exists some constant A > 0, such that 


L [fi] (t, s)-L 1 / 2 ] (t, s) 


< K 


fl(T,T)dT _ -Pf^ / 2 (T,T)d 7 


moreover, since |e ^ — e <\x — y \, Vx, y G [0, 00 ), then 


thus 


o-0(T-s 


L [fi] {t, s)-L [/a] (t, s) I < Kb'^J |/i (r, r) - /a (r, r) | dr, 
L[fi]it,s) - L[f2]it,s) < / |/i (r, r) -/a (r, t)| dr, 

J S 

II/.-MU, 


(6.9) 
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hence 


L[h]-L[h] 


< 


K9 (1 - e-P^) 


00,0 


- ll/l - /2|loo,/3 ■ 


Therefore L [.] is a contraction mapping for {3 large enough. 

2) Uniqueness of solution. Let /i ,/2 G C {D [0,T] be two solutions, then 
fi {t, s)=L [fi] (t, s) and /2 (t, s) = L [/ 2 ] (t, s) , V (t, s) e U [0, T]. 
From (6.9) we have 

l/i (s,s) - /2 (s,s)| < f |/i (t,t) - /2 (r,T)| dr, Vs e [0,T], 

J S 

therefore, by Gronwall Lemma, we conclude that |/i (s,s) — /2 (s,s)| = 0,Vs € [0,T], hence 

fi (t, s) = L [fi] (t, s) 

/ (2a-b^T(r,r)-b^/i(T,T))dT 

= h (t) eds 

r 

I (2a—b'^T{T,T) — b^f2{T,T)^dT 

= h (t) eds 

= / 2 (t,s), V(t,s)eu[0,T] 


This completes the proof. 
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